Extending SQL for Decision Support Applications

نویسندگان

  • Haixun Wang
  • Carlo Zaniolo
چکیده

Extended Abstract The challenge of extending database systems for decision support applications has been the topic of much recent research—a very incomplete list of previous work includes [11, 8, 12, 4, 10, 5]. Yet, there is no generally accepted solution for the problem, which remains a critical one, since the inability of current DBMSs to support data mining applications is well-tested and clearly documented [12]. Our research approach in addressing this difficult problem is motivated by the observation that aggregate functions provide the linchpin for most decision support computations; moreover inductive discovery from large data sets can be viewed as the process of aggregat-ing low level data into statistical summaries of semantic significance. Therefore, the ATLaS system designed at UCLA [2] allows end-users to define new powerful aggregate functions by writing them in SQL. The same mechanism can be used to define new table functions in ATLaS, whose name stands for Aggregate & Table Language and System. ATLaS is the successor of the AXL system described in [15]. These SQL-based native extension mechanisms turn ATLaS into a powerful and flexible system for advanced data-intensive applications, including applications from many domains that are not supported well by current Object-Relational database systems, which still suffer from limited extensibility. In fact, the only extensibility mechanism now provided by Object-Relational systems relies on nonnative datablades—i.e., on external functions defined in a procedural language and imported into SQL. ATLaS is very effective at expressing decision support tasks: we demonstrate this property by the efficient implementation of several functions, such as rollups, datacubes, clas-sifiers, and frequent item sets for association rules [2]. The performance of these functions expressed in ATLaS is typically within 30% of the performance of the same algorithms coded in C/C++. To achieve this level of performance, ATLaS supports various optimization techniques, and the ability of manipulating in-memory tables in SQL. In fact, attributes of reference type in such tables allow the efficient support of data structures, such as tries, that are instrumental in implementing data mining algorithms, such as Apriori [1, 7]. The stream-oriented computation model used by ATLaS contrasts with the computation model based on 'blocking' semantics that is normally used for aggregates in current database systems. Thus, online aggregates [9], time-series queries [13], sliding-window aggregates , approximate aggregates, and continuous queries [3] are naturally supported in ATLaS. Furthermore, important properties of an ATLaS program, such as blocking behavior and monotonicity, …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ATLaS: A Native Extension of SQL for Data Mining

A lack of power and extensibility in their query languages has seriously limited the generality of DBMSs and hampered their ability to support data mining applications. Thus, there is a pressing need for more general mechanisms for extending DBMSs to support efficiently database-centric data mining appliacations. To satisfy this need, we propose a new extensibility mechanism for SQL-compliant D...

متن کامل

MySQL Data Mining: Extending MySQL to Support Data Mining Primitives (Demo)

The development of predictive applications built on top of knowledge bases is rapidly growing, therefore database systems, especially the commercial ones, are boosting with native data mining analytical tools. In this paper, we present an integration of data mining primitives on top of MySQL 5.1. In particular, we extended MySQL to support frequent itemsets computation and classification based ...

متن کامل

Database System Extensions for Decision Support : the AXL

Research on database-centric data mining is seeking to improve the eeectiveness of database systems in decision support applications. Diierent solutions are now used for diierent problems, including (i) SQL extensions for more complex OLAP queries, (ii) new datablades for special data types such as time-series, and (iii) architectural extensions to support data mining functions. Here, we propos...

متن کامل

Database System Extensions for Decision Support: the AXL Approach

Research on database-centric data mining is seeking to improve the e ectiveness of database systems in decision support applications. Di erent solutions are now used for di erent problems, including (i) SQL extensions for more complex OLAP queries, (ii) new datablades for special data types such as time-series, and (iii) architectural extensions to support data mining functions. Here, we propos...

متن کامل

ATLaS: A Native Extension of SQL for Data Mining and Stream Computations

A lack of power and extensibility in their query languages has seriously limited the generality of DBMSs and hampered their ability to support new application domains. Considerable efforts by database researchers and commercial DBMS vendors have led to major extensions; yet there remain important applications—particularly data mining—that are not supported well in SQL-3. Thus, there is a pressi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002